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the Effect of Format and Organization on Extrapolation 

V ^ . ' 

and Interpolation with Multiple Trend Displays 

MARY L. WOLFE and VICTOR R. MAFTUZR Oh^ersity of Delaware 



Fifty-six undergraduates participated in an investigation of 
the effects of format (bar graphs vs, tables) and organization 'of 
data on the speed and accuracy of extrapolation and interpolation 
. with multiple^ nonlinear trend displays. Other variables of interest 

/ , ^ r 

/ were trend direction (increasing, decreasing) ana acceleration (low^ 
voderatef high). Interpolation was relatively fast «hd accurate and 

was unaffected by experijoental^ variables. Extrapolated values were 

1 

overestimated with decreasing trends and underestimated with increas- 
ing *trends» this effect was greatest with highly accelerated trends. 
Response latencies were lowers but accuracy was unaffectedf when data 
were clustered so th^t trends were perceptually salient. 
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Objective? 

It is widely believed that graphs and tables are effective 
medlA for displaying quantitative infonnation. 'Evidence of this 
belief can be found by examining textbooks^ scientific and technical 
journals, business re^rts and similar docuinents. Instructional 
programs, in such diverse fields as geography^ business ai>d economics^ 
matheiftatics, and the physical, biological and social soxences typi- 
cally include the development of skill in the use and construction 
of grafts and tables as a major objective. In addition, exeuoination 
of numerous standardized tests of educational achievement suggest 
thit comprehension of information displayed in graphs and tables is 
a valued ability. 

^'O tasks for which graphical and tabular displays seem 
partteularly well^suited are interpolation (e.g., in the u3e of tables 
of logarithms or cumulative distribution functions) and extrapolation 
te.g.* in the prediction of trends). However* very^ little systematic 
research has apparently been conducted to determine (a) the specific 
skills needed to perform these tasks; (b) the instructional techniques 
which maxi^ze the acquisition of these skills; an^^) the effects 
of selected variables* such as format, visual complexity ^ and organic 
zation of data, on the utility of graphs and tables for transmitting 
quantitative information. 

Hashburne (1927) examined the effect of selet^ed graphical, 
tabular and verbal methods of presenting time-varying .ijuantitative 
information on retention in junior high sch^l students. Ihe 
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critfrion test measured performance on three types of telSt items: 
(a) specific amount items, requiring recall of the price of a 
particular product at a given point in time; (b) static TTOmparison 
items, cabling for comparison df the pricea of two. or more products 
at the same point in time; and (c) dynamic ^con^arison items, 
requiring the comparison of price trends 'over a specific time inter- 
val} Washburne^s data suggest that performance on each item type 
is a£fected by the manner in which the information is displayed * 
(prose, pictograph, or bar graph), the amount^^ information presented, 
and the logical organization of the information within the display; ^ 
However, the results are somewhat equivocal because completion 
(recall) items were used to assess specific amunt leamingr* while 
multiple-choice (recognition) items were used to measure static 

■ r I 

and dynamic cociparison performance* Schutz (1961a) studied the 
effect of format (vertical bar, horizontal bar and line graph), 
amount of irrelevant data, and amount of missing data on the speed 
and accuracy of performance! on a single-trend detection task. He 
concluded that line graphs were most effective and horizontal bar 
graphs least effective for displaying single trends, and that per- 
formance tended to deteriorate as the amounts of irrelevant and 
missing data increased. In a second experiment (1961b) r concerned 
with methods of displaying Multiple trends, Schutz found that 
multiple-line single^graph displays were superior to single-line 
multiple-graph displays for point comparison tasks. However, Schutz 
did not study the effect of the organization or grouping of data 
on performance with bar graphs — a potentially important factor when 
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this fo^tmat is used to display multiple trqind^ within a single gr^aph 
(e,g,, the annual earnings for eacl\ of four corporations over a. ten- 
year p^iod) , In addition^ neither Washburne nor Schutz investigated 
the relative efficacy of graphs and tables for displaying trends, 

Perspective/Theoretical Fraaiewjj^ pi^^ 

Recent studies Ce,g,, Prase, 1969) hav^^^alt with the prob- 
lem of ^earning prose frocn matrix- structured passages^ where the 
dimensions of the matrix are classes and qualitatively different 
* ^attributes^of these classes. The Inatrix tells^ contain attribute 
' values associated with each of the classes. Information having this 
kirvd of structure- can be organized In severait ways; the two most 
of^n studied &re organization by row and organization by column^ 
Qu^lntitative information^li^ that usually displayed in tables and 
graphs ofteif has a siailar^oiatrV structure. For instance, the 
classes My be coinnoditieSf sucn as different brands of c€realf while 
the attribute dimension may consist of a single quantitative variable 
such as time^ The cells of the matrix then contain values of some 
tlme-v*rying quantity, such ar^it price. This information may be 
organized or clustered in two^ways in a table or bar graph. The 
prices of different brands of cereal v^Y be grouped together for each 
year in the display (org^ization by year) or the varying prices of 
the same brand during successive years may be grouped together so 
- that the pflci^^end for each brand is perceptually salient (organiza 
^tion by brand),, It seems reasonable to predict thin the two types of 
Organization should differentially affect performance on both inter* 
polation and extrapolation tasks. 



Itie major purpose of the present experiment was to exa&u^ 
the effects of format (bar graphs vs, tables) and organization (by \ 
year vs. by brand) on the speed and accuracy of performance on inter- 
polat^n and extrapolation tasks with mul/iiple, non^linear trend 
displays. Other independent variables included in this study were 
trend direction (increasing and decreasing) and -trend acceleration 
(loW| moderate and high) , 

Methods 

Two data matrices were prepared showing the prices in cents 
per bar for each of three different brands of candy during each of 
the years 1930, 1940, 1950/ 1960 and 1970. In one matrix^ the price 
trends were inqreasing; in the others trends were decreasing. For 
the firjst brand, differences between successive prices were If 2, 
3 and 4. (For exanplef in the increasing trend condition, prices were 
ISf 16f 16/ 21 and 25; in the decreasing txen<^ condition, prices were 
27, 26f 24f 21 and 17). For the second and third bfi^dSf differences 
between successive prices were 1, 3, 5, 7 and 1, 4, 7# 11, respectively, 
IhuSf for each brands ^'^Qie'Thcreasing and decreasing trends were 
determined by a "rule" (a regular sequence of successive price 
differences), ^e third trend showed the greatest acceleration (where 
acceleration was positive for the increasing trends negative for the 
decreasing trend). The first trend showed the least acceleration, 

, For each of these data matrices, four stimulus displays were 
pirepared, one for each of the format x organization conditions, the 
first display consisted of a table with the prices grouped by brand; 
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the second consisted of a table with the prices grouped by year. The 
third .and fourth displays were vertical bar graphs with the prices 
grouped by br^nd and by year, respectively. The tables were typed 
oy^Bh'* X 11" sheets of white paper, while the graphs were drawn on 
^^S"* by 14" sheets of white paper. . Subjects, who were tested indi* 
vidually, were randomly assigned in equal numbers to the four format 

X organizatioi^ conditions^ Each subject was instructed to estimate, 

f 

to the nearest whole number, the^ price of each brand of candy in 1980 

and in 1990 (extrapolation) as well as in 1945 and 1965 (interpolation), 

using both the increasing and decr,ea%ing tfend displays. Subjects 

were told to work as quickly and as accurately as possible^ The 24 

extrapolation *and interpolation questions were independently randomized, 
t 

for each subject, each Qf whom recorded his or her responses in a test 
booklets , Response latencies were recorded to the nearest hundredth of 
a second by the experimenter. Separate analyses of variance were 
perfonned on signed error scores and on log latencies. 

6a ta Sourge * ' 

Fifty-six undergraduates enrolled in the College of Education 
'.at the university of Delaware served as volunteers^ They were awarded 
credit toward course, requirements for participation in the experiments 

Wsults and Conclusions 

Error scores wer'e computed by subtracting the correct extra- 
polated or interpolated value (determined by applying the rule for 
generating each trend) from each subject's response (see Table 1 
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for cell Qieans and standard deviations) , Analysis of variance of 
the error scores* using the Geisser-Greenhouse (1958) test with 
conservative degrees of freedom, showed si^ificant main fects of 
trend direction (F = 110,10* df * 1,52; p<,01) and type of task 
(F ~ 8,ldf df ^ 1,52; p< ,01) , Significant interactions were found 
between trend and task (F * 137,^0, df » 1,52? p< ,01), between trend 



and rule (f'^= 78,81, df ^ lr52; p<,01)r and between trends task, and 
rule (F ^ 42,33f df = 1^52; pC,01). Ihere was a consistent tendency 
to overestimate extrapolated prices when the l^rend was decreasing, 
and to underestimate them when the trend was increasing, . When extra- 
polation was performed with the decreasing trendsr 69,1% of the 
responses were overestimated f 11,6% were underestimated, and 19,3% 
were "correct, With the increasing trend, 67:0% of the responses , 
were underestimated, 12,2% were overestimated, and 20,8% were correct, 
Ihe extrapolation error was greatestr for each trend direction, when 
the acceleration was greatest, The latter effect was most pronounced 
when trends were extrapolated to 1990, where mean errors were -13,3 and 
15,6 f(^ increasing and decreasing trends, respectively. In contrast, 
interpolated estimates were relatively accurate regardless of trend 

■ 1 

direction or acceleration! the largest mean error was only 1,'30, 

Analysis of variance of log latencies, again using conserva- 
tive degrees of freedom, showed significamt i|ain effects ,df -organization 
XF ^ B,866^ df-^ 1,52; pi!, 01), task (F = 33,395, df * 1,52; p< .01) 
and rule (F = 36,294, df - 1,52; p< ,01) (see Table 2 for cell means 
and standard deviations). Significant interactiotts were, found between 



format and trend (F = 4,203^ df = 1#52? p^*05), between fonnat and 
rule (F = 4*786# df^^^J-^^^T p^*05) and between trend and rule 
(F = 7*465* df » 1,52; p^*<^) » Organization by brand appeared to 
enable subjects to search both tabular and graphical displays more 
rapidly than organi^tion by year. In addition, interpolation was ' 
performed more quickly than extrapolation, with extrapolation to 
1990 resulting in the longest latency* Response times were longest 
for trends with the greatest acceleration; this effect was most 
pronounced with increasing trends* Although ,^ere was no main 
effect of format on response latency* decreasing trends resulted in 
longer latencies for subjects viewing tabular displays. Moreover, 
the effect of trend acceleration appeared to be greater with tables 
than with graphs. 

The results suggest that the hypothesized facilitating effect 
of organization by brand may be mamifested in response latency 
rather than in accuracy; that is, subjects may be able to attain the 
same precision in less time wtj^fTthe data are clustered in such a way 

that the trends are readily perceived* ihe often-assumed superiority 

f 

of graphs over tjhbles for displaying trends was not supported by the 
results o^^e pre^i^nt experiments perhaps because t^e rules for 
generating the trends were equally appar^t vith either the tabular 
or the graphical format* 

The tendency to over- or underestimate in extrapolation* 
depending on the direction of the trend^'was altogether consistent 
(Mefpigure 1)* One possible explanation o£ thi9:-tendency \is that it 
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may have been an artifact of the way in which the graphical displays 

were constructed. In the case of the increasing trend with'^the greatest 

curvature^ the correct value extrapolated to 1990 was 73/ while the 

highest value on the ^Vertical axis was only 70. That is* subjects 

may have perceived the value of 73 to be 6ff the scale, and lowered 

their estiinates accordingly. However, when 6ne observes that this 

source of bias would not have 4:>een present with the tabular displays, 

or with the decreasing tr^d graphs, this explanation ^^oses much of 

its plausibility. 

Another, more reasonable interpretation is the following. 

Spencer (1961) found that subjects in a graphical extrapolation task 

tended to respond as though a regression line were to be constructed 

through the set of points in the stimulus. If the best-^f itting straight 

line is constructed throu^rh^a set of points which define an increasing 

flection whose graph is concave upward,, that line will underestimate 

extrapolation of the trend* Moreoverr the underestimation will be 

greater for long-term than for short-term predictions, and will increase 

* 

with increasing curvature of the trend. A similar-ar^^ument ^hows thatf 

/ 

a regression line will lead to overestitoation yfi^th a decreasing, 
concave-downward trend. Perhaps sdbjMta the present experiment 

: ^ 

resorted to a similar strategy, particalarly when the acceleration 
of the trend, and distance between the last data point and the point 
to which extrapolation was to be n^de, ccnnbined to make application 
"of the rule particularly difficult. The comparative efficiency with 
%«hich interpolation was performed was reflected in both error and , 
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latency scores, and is ndt surprising. The task essentially involves ' 
locating the midpoint of'l line segment fining two^adjacent points in 
the Cartesiaii plane, and ii^both visually an^ computationally muc^ 
easier than extrapolation, ^ .'\ v 

Educational or Scientific Importance -pf the Study ■ Y\ 

- 1 

The recent interest in the acquisitioh and retention qf infor- ^ 
mation from prose si^^gests that when the information is. quantitative, 
factors influencing the effectiveness of accompanying tab).es and graphs 
should be given careful attention^ The preseiit experiment focused on 
selected, readily-manipulated characteristics of graphs and tables 
themselves, apart from related textual material ^ In view of the 
exploratory nature of the study, definite recommendations regarding ^ 
the construction, selection or use of graphs and tables for instructional 
purposes seems premature^- However, the results suggest that format 
and, particularly, organization should be investigated further, since, 
both seem to play an important role in extrapolation performance, 
especially when the trends depart markedly from linearity^ Identifi-^ 
cat;io;i of those factors which affect the acquisition and retention of 
information ^rom graphs and tables will facilitate stu^ of the useful- 
'liess of such displays as adjuncts in learning from prose, and, in 
development of instructional strategies relevant to the acquisition of 
quantitative information from printed media^ 
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TABLE 1- SIGNED ERRORS 



Increasing 





Extrapola tlon 


Interpolation * 


1980 


1990 


1945 


1965- 


Rl^ R2 R3 


Rl R2 R3 


'Rl R2 . Ft3 


Rl R2 R3 


Bars X 

by 

Year SD 


0,07 -2,21 -3,93 
4,86 2^04 3,97 


-0,29 -6,71 -14,36 
* 

6,09 3,50 7,89 


1,00 0.43 0,14 
3,57 4,4jp 2,93 


1 ,57 0,43 '1, 21 
4,07 1, 34 2.94 


Bars X 

by ' 

Brand SD 


0,64 ^ 'O, 79 -2,50 
2,92 ^,24 4,45 


3,36 -4,93 -11,64 
9,64 6,03 9,65 


2, 29 0,50 0, 79 
4^95 '\ 3, 30 3,07 


0,57 -0,14 -0.36 
3, 34 0.96 3.91 


Tables x 

by 

Year SD 


"0,14 0,64 -4,21 
0,53 5*47 3,95 


-3,43 -5, 29 -12.93 
* 

5.72 4*63 7,52 


0*00 0,64 0,07 
0,00 , 0*50 0,27 


-0,14 0,29 -1*21 
1,03 2,20 3,5i 


Tables x 
by 

Brand SD 


-1,00 -1,93 -2,00. 
1*11 -2,27 2,75 


0,43 -6,43 -^14,14 
8,82 6*88 10*82 


0, 14 -0,93 0,07 
0,53 5,24 0,27 


0,00 -0,07 -0,29 

0,39 \' 1*^^4 1,44 
* ' 
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TABLE 1. SIGHED ERRORS (Continued) 



i 





Decreasing 


Extrapolation 


Interpolation 


1980 J 


1990 


1945 


1965 


Rl R2 R3 


Rl R2 R3 


Rl R2 R3 


Rl R2 R3 


Bars X 

by 

Year SD 


1,,93 2-29 2.86 
5,08 4.70 6.32 


4.93 12.43 13.64 
S.93 10,63 5-'^94 


-1.29 -0.43 -0,64 
2.55 5.81 2.06 


0.14 0-29 0.93 
2-21 3;32 4.78 


Bars X > 
by 

Brand SD 


0-21 2.43 1.86 
1.72 2.53 3.39 


1.93 6.71 16-00 
2.20 3.91 11.14 


-1,93 -1.00 -1. 21 
3.83 * 0.00 , 2.61 


-0-86 -0.71 0.00 
2.18 1-14 8.74 


Tables X 
by 

Year |SD 


0.93 . 2.57 2.71 
2.20 6,82 3.41 


_3y.43 11.21 18.36 
4.57 12.21 16.33 


-0.21 -0-07 0-00 
0.81 0.73 0.00 


0. 31 -0. 29 2.07 
1.18 2.55 4.18 


Tables X 

by 

Brand ,SD * 


0.64 3-07 2.86 
1.78 4.76 4.66 


3.57 11-57 14. 21 
7..07 13.15 10.43 


-0.14 -1-29 0.36 
0.53 4.38 1.08 


-0-21 -2.00 2.21 
0.58 7.23 5.29 
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TABLE 2 . LOG LATENCIES 



1 

■4 


fD — ' ' ( 

Increasing 


Extfalpolatlon 


'Interpolation 


1980 


, 1990 


1^45 


1965 


?R1 Ri 


Rl R2 R3 . 


Rl R2 R3 


Rl R2 R3 * 


Bars 3£ 
by ^ 
Yisar SD " 


0.55 0.58 0.50 

• 


+ . 

0.6O 0.47 * 0.67 


*> CA *> TT *> TC 

0.38 ' 0.^7 a.44 


/ 

0.50 , 0U5 0.58 


Bars 3c 
by 

Brand SD 


2.06 2.48 ^ 2.39 
0.40 0.6& 0.64 


2.46 2.66 2.38 

* * 

a.50 0.68 0.56 


2.17 _ 2.03 2.21 
0.38 0.45. 0.38, 


2.18 2.35 2.16 
0.46 *^0.^9/ ^ 0.53 


Tables k 
by . 

Year SD 


2.28 , 2.90 2.60 
*0.37 0^63 0.58 


2.k4 . 2.90 ' 2.8(3*' 
0.39 0.55 0.54 


^ 2._06 2.41 2.53 
0.44 0.50 0.5.7 


2.33 2.e2 2.70' 
0.49 ,0.61 0.47 


Tables 3C ' 
by 

Brand SD^. 


^.22 :2.38 2.35 
0.41 0.62 0.61 


2.40 ^ 2.66 ^ 2.50 
0.40 0.69 0.65 


2.01 2.21 2.41 
0.36 0.43 0.47 


2.10 2.51 2.41 
0.40 0.63 . 0.33 
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TABff 2- LOG lATfcNCIES 



1 ^ 




Increasing 


1 


Extrapolation 


Interpolation 


I960 


* 1^90 








Rl R2 - R3 


Rl R2 R3 


Rl R2 n3 


by 

Y^r SD 


^«D^ ^.O/ DO ' 

0,55 0-58 0.50 


0.60 0.47 0.67 


^ . 0K> ^ « / / 

0.38 0.57 0,44 


0.50 0.45 0.58 


Bars X 
Brand SD 


— r 

2.06 2.48 2.39 
0.40 0.66 0.64 


2.46 2.66 2.38 
0.50 0.68 0.56 , 


2.17 2,03 2,21 
0.38 0.45 0.36 


2,18 2.35 2.16 
0,46 0.29; 0.53 


Tables X 
Year SD * 


2.28 . 2.90 ' 2.60 
0.37 0.63 0.58 


2.64 2.90 2.66 
0.39 0.55 0.54 


2.06 2.4J. 2.53 
0-44 0.50 0,57 


2.33 2-62 2.70 
0.4? 0.61 0.47 


Tables X 
by 

Brand SD 


J-22 \ 2.38 2.35 
0.41 0.62 0.61 


2.40 2.66 2.50 
0.40 0.69 0.65 


2.01 2,21 2.41 
0.36 0.43 Q,47 


2.10 2.51 2.41 
6.40 0.63 0.33 



TABLE 2* LOG LATENCIES (CONTINUED) 



-a 





I Decreasing ' 


Extrapolation 


Interpolation 


1980^ 


1990 


1945 


1965 


Rl R2 R3 


Rl R2 R3 


Rl R2 R3 


Rl R2 R3 


Bars X 
by 

Y«Ar SD 


2.61 2.56 2*81 
0-61 0-53 0.67 


2*,60 2*73 2*84 
0-39 0*72 0-62 


2.61 2*54 2*57 
0-56 0*51 0-48 


2*62 2-75 2*79 
0-50 0*52 0*54 


Bars X . 
by 

Brand SD 


2-27 2.37 2*31 
0-58 0,50 0,51 


2-37 * 2,66 2*43 
0,62 (i,62 0-68 


2*14 *2*ia 2*08 
0*40 0*37 0*49 


2*14 2-32 2-23 
0-53 0*57 0-46 


Tabled x 
by 

Year SD 


2-70 2,78 2-87 
0-38 0,*7e 0-53- 


2*78 3*06 2-82 
0-61 0-55 0*45 


2-33 2-25 2.54 
0-27 0-37 0-29 


2-63 2-58 2<78 
0-36 0-44 0-48 


Tables X 
by 

Brand SD 


^ 1 ' " 

2-31 2-51 2-46 

0-46 0-62 0-49 


2-59 5*74 2-84 
0-59 0-66 0*69 


2-22 2-27 2-36 
0-51 0*54 0*51 


2-19 2-50 2*55 
0-54 0-65 0*60 
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Fig. 1. Ccnnparison of Mean Extrapolation Performance with Correct Exjtrapolated Response for 
Increasing and Decreasing Trerfds ; 
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